AITopics | computational notebook

Collaborating Authors

computational notebook

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NoteEx: Interactive Visual Context Manipulation for LLM-Assisted Exploratory Data Analysis in Computational Notebooks

Payandeh, Mohammad Hasan, Yuan, Lin-Ping, Zhao, Jian

arXiv.org Artificial IntelligenceNov-11-2025

Computational notebooks have become popular for Exploratory Data Analysis (EDA), augmented by LLM-based code generation and result interpretation. Effective LLM assistance hinges on selecting informative context -- the minimal set of cells whose code, data, or outputs suffice to answer a prompt. As notebooks grow long and messy, users can lose track of the mental model of their analysis. They thus fail to curate appropriate contexts for LLM tasks, causing frustration and tedious prompt engineering. We conducted a formative study (n=6) that surfaced challenges in LLM context selection and mental model maintenance. Therefore, we introduce NoteEx, a JupyterLab extension that provides a semantic visualization of the EDA workflow, allowing analysts to externalize their mental model, specify analysis dependencies, and enable interactive selection of task-relevant contexts for LLMs. A user study (n=12) against a baseline shows that NoteEx improved mental model retention and context selection, leading to more accurate and relevant LLM responses.

artificial intelligence, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2511.07223

Country:

North America > United States (1.00)
North America > Canada > Ontario (0.28)
Asia > Japan > Honshū > Kantō (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Personal > Interview (0.67)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Flowco: Rethinking Data Analysis in the Age of LLMs

Freund, Stephen N., Simon, Brooke, Berger, Emery D., Jun, Eunice

arXiv.org Artificial IntelligenceApr-22-2025

Conducting data analysis typically involves authoring code to transform, visualize, analyze, and interpret data. Large language models (LLMs) are now capable of generating such code for simple, routine analyses. LLMs promise to democratize data science by enabling those with limited programming expertise to conduct data analyses, including in scientific research, business, and policymaking. However, analysts in many real-world settings must often exercise fine-grained control over specific analysis steps, verify intermediate results explicitly, and iteratively refine their analytical approaches. Such tasks present barriers to building robust and reproducible analyses using LLMs alone or even in conjunction with existing authoring tools (e.g., computational notebooks). This paper introduces Flowco, a new mixed-initiative system to address these challenges. Flowco leverages a visual dataflow programming model and integrates LLMs into every phase of the authoring process. A user study suggests that Flowco supports analysts, particularly those with less programming experience, in quickly authoring, debugging, and refining data analyses.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2504.14038

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
North America > United States > Massachusetts (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Energy (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational Notebooks

Grotov, Konstantin, Borzilov, Artem, Krivobok, Maksim, Bryksin, Timofey, Zharov, Yaroslav

arXiv.org Artificial IntelligenceOct-18-2024

Computational notebooks became indispensable tools for research-related development, offering unprecedented interactivity and flexibility in the development process. However, these benefits come at the cost of reproducibility and an increased potential for bugs. With the rise of code-fluent Large Language Models empowered with agentic techniques, smart bug-fixing tools with a high level of autonomy have emerged. However, those tools are tuned for classical script programming and still struggle with non-linear computational notebooks. In this paper, we present an AI agent designed specifically for error resolution in a computational notebook. We have developed an agentic system capable of exploring a notebook environment by interacting with it -- similar to how a user would -- and integrated the system into the JetBrains service for collaborative data science called Datalore. We evaluate our approach against the pre-existing single-action solution by comparing costs and conducting a user study. Users rate the error resolution capabilities of the agentic system higher but experience difficulties with UI. We share the results of the study and consider them valuable for further improving user-agent collaboration.

agent, artificial intelligence, natural language, (16 more...)

arXiv.org Artificial Intelligence

2410.14393

Genre:

Questionnaire & Opinion Survey (0.90)
Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

BISCUIT: Scaffolding LLM-Generated Code with Ephemeral UIs in Computational Notebooks

Cheng, Ruijia, Barik, Titus, Leung, Alan, Hohman, Fred, Nichols, Jeffrey

arXiv.org Artificial IntelligenceJul-11-2024

Programmers frequently engage with machine learning tutorials in computational notebooks and have been adopting code generation technologies based on large language models (LLMs). However, they encounter difficulties in understanding and working with code produced by LLMs. To mitigate these challenges, we introduce a novel workflow into computational notebooks that augments LLM-based code generation with an additional ephemeral UI step, offering users UI scaffolds as an intermediate stage between user prompts and code generation. We present this workflow in BISCUIT, an extension for JupyterLab that provides users with ephemeral UIs generated by LLMs based on the context of their code and intentions, scaffolding users to understand, guide, and explore with LLM-generated code. Through a user study where 10 novices used BISCUIT for machine learning tutorials, we found that BISCUIT offers users representations of code to aid their understanding, reduces the complexity of prompt engineering, and creates a playground for users to explore different variables and iterate on their ideas.

ephemeral uis, iscuit, tutorial, (14 more...)

arXiv.org Artificial Intelligence

2404.07387

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (1.00)
Workflow (0.70)
Instructional Material > Course Syllabus & Notes (0.68)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks

Grotov, Konstantin, Titov, Sergey, Zharov, Yaroslav, Bryksin, Timofey

arXiv.org Artificial IntelligenceMar-26-2024

Computational notebooks became indispensable tools for research-related development, offering unprecedented interactivity and flexibility in the development process. However, these benefits come at the cost of reproducibility and an increased potential for bugs. There are many tools for bug fixing; however, they are generally targeted at the classical linear code. With the rise of code-fluent Large Language Models, a new stream of smart bug-fixing tools has emerged. However, the applicability of those tools is still problematic for non-linear computational notebooks. In this paper, we propose a potential solution for resolving errors in computational notebooks via an iterative LLM-based agent. We discuss the questions raised by this approach and share a novel dataset of computational notebooks containing bugs to facilitate the research of the proposed approach.

agent, computational notebook, notebook, (11 more...)

arXiv.org Artificial Intelligence

2405.01559

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Middle East > Cyprus (0.05)
Europe > Germany (0.05)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.96)

Add feedback

SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks

Wang, Zijie J., Munechika, David, Lee, Seongmin, Chau, Duen Horng

arXiv.org Artificial IntelligenceMay-4-2023

Computational notebooks such as Jupyter Notebook have become data scientists' de facto programming environments. Many visualization researchers and practitioners have developed interactive visualization tools that support notebooks. However, little is known about the appropriate design of visual analytics (VA) tools in notebooks. To bridge this critical research gap, we investigate the design strategies in this space by analyzing 159 notebook VA tools and their users' feedback. Our analysis encompasses 62 systems from academic papers and 103 systems sourced from a pool of 55k notebooks containing interactive visualizations that we obtain via scraping 8.6 million notebooks on GitHub. We also examine findings from 15 user studies and user feedback in 379 GitHub issues. Through this work, we identify unique design opportunities and considerations for future notebook VA tools, such as using and manipulating multimodal data in notebooks as well as balancing the degree of visualization-notebook integration. Finally, we develop SuperNOVA, an open-source interactive tool to help researchers explore existing notebook VA tools and search for related work.

artificial intelligence, human computer interaction, va tool, (15 more...)

arXiv.org Artificial Intelligence

2305.03039

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Oceania > Australia (0.04)
North America > United States > California (0.04)
Asia > Turkmenistan > Ahal Region > Anau (0.04)

Genre: Research Report (0.82)

Industry:

Information Technology (0.68)
Education (0.46)

Technology:

Information Technology > Visualization (1.00)
Information Technology > Software (1.00)
Information Technology > Human Computer Interaction > Interfaces (1.00)
(2 more...)

Add feedback

Telling Stories from Computational Notebooks: AI-Assisted Presentation Slides Creation for Presenting Data Science Work

Zheng, Chengbo, Wang, Dakuo, Wang, April Yi, Ma, Xiaojuan

arXiv.org Artificial IntelligenceJan-26-2023

Creating presentation slides is a critical but time-consuming task for data scientists. While researchers have proposed many AI techniques to lift data scientists' burden on data preparation and model selection, few have targeted the presentation creation task. Based on the needs identified from a formative study, this paper presents NB2Slides, an AI system that facilitates users to compose presentations of their data science work. NB2Slides uses deep learning methods as well as example-based prompts to generate slides from computational notebooks, and take users' input (e.g., audience background) to structure the slides. NB2Slides also provides an interactive visualization that links the slides with the notebook to help users further edit the slides. A follow-up user evaluation with 12 data scientists shows that participants believed NB2Slides can improve efficiency and reduces the complexity of creating slides. Yet, participants questioned the future of full automation and suggested a human-AI collaboration paradigm.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3491102.3517615

2203.11085

Country:

North America > United States > District of Columbia > Washington (0.05)
Asia > China > Hong Kong (0.04)
North America > United States > Virginia (0.04)
(4 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Personal > Interview (1.00)
Research Report > New Finding (0.92)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.88)

Add feedback

Towards Multifaceted Human-Centered AI

Rahman, Sajjadur, Kim, Hannah, Zhang, Dan, Hruschka, Estevam, Kandogan, Eser

arXiv.org Artificial IntelligenceJan-9-2023

Human-centered AI workflows involve stakeholders with multiple roles interacting with each other and automated agents to accomplish diverse tasks. In this paper, we call for a holistic view when designing support mechanisms, such as interaction paradigms, interfaces, and systems, for these multifaceted workflows.

artificial intelligence, natural language, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2301.03656

Country:

North America > United States > New York > New York County > New York City (0.07)
Europe > Sweden > Uppsala County > Uppsala (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.72)
Information Technology > Human Computer Interaction > Interfaces (0.50)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.50)
Information Technology > Artificial Intelligence > Natural Language (0.48)

Add feedback

MEGAnno: Exploratory Labeling for NLP in Computational Notebooks

Zhang, Dan, Kim, Hannah, Chen, Rafael Li, Kandogan, Eser, Hruschka, Estevam

arXiv.org Artificial IntelligenceJan-8-2023

We present MEGAnno, a novel exploratory annotation framework designed for NLP researchers and practitioners. Unlike existing labeling tools that focus on data labeling only, our framework aims to support a broader, iterative ML workflow including data exploration and model development. With MEGAnno's API, users can programmatically explore the data through sophisticated search and automated suggestion functions and incrementally update task schema as their project evolve. Combined with our widget, the users can interactively sort, filter, and assign labels to multiple items simultaneously in the same notebook where the rest of the NLP project resides. We demonstrate MEGAnno's flexible, exploratory, efficient, and seamless labeling experience through a sentiment analysis use case.

annotation, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2301.03095

Country:

North America > United States > New York > New York County > New York City (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.34)

Add feedback

Documentation Matters: Human-Centered AI System to Assist Data Science Code Documentation in Computational Notebooks

#artificialintelligenceJul-4-2022, 09:20:52 GMT

Computational notebooks allow data scientists to express their ideas through a combination of code and documentation. However, data scientists often pay attention only to the code, and neglect creating or updating their documentation during quick iterations. Inspired by human documentation practices learned from 80 highly-voted Kaggle notebooks, we design and implement Themisto, an automated documentation generation system to explore how human-centered AI systems can support human data scientists in the machine learning code documentation scenario. Themisto facilitates the creation of documentation via three approaches: a deep-learning-based approach to generate documentation for source code, a query-based approach to retrieve online API documentation for source code, and a user prompt approach to nudge users to write documentation. We evaluated Themisto in a within-subjects experiment with 24 data science practitioners, and found that automated documentation generation techniques reduced the time for writing documentation, reminded participants to document code they would have ignored, and improved participants' satisfaction with their computational notebook.

assist data science code documentation, documentation, human-centered ai system, (6 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback